2 research outputs found

    Efficient Noise Suppression for Robust Speech Recognition

    Get PDF
    Electrical EngineeringThis thesis addresses the issues of single microphone based noise estimation technique for speech recognition in noise environments. A lot of researches have been performed on the environmental noise estimation, however most of them require voice activity detector (VAD) for accurate estimation of noise characteristics. I propose two approaches for efficient noise estimation without VAD. The first approach aims at improving the conventional quantile-based noise estimation (QBNE). I fostered the QBNE by adjusting the quantile level (QL) according to the relative amount of added noise to the target speech. Basically, we assign two different QLs, i.e., binary levels, according to the measured statistical moment of log scale power spectrum at each frequency. The second approach is applying dual mixture parametric model in computing likelihoods of speech and non-speech classes. I used dual Gaussian mixture model (GMM) and Rayleigh mixture model (RMM) for the likelihoods. From the assumption that speech is generally uncorrelated to the environmental noises, the noise power spectrum can be estimated by using each mixture model parameter of speech absence class. I compared the proposed methods with the conventional QBNE and minimum statistics based method on a simple speech recognition task in various signal-to-noise ratio (SNR) levels. Based on the experimental results, the proposed methods are shown to be superior to the conventional methods.ope

    Variable quantile level based noise suppression for robust speech recognition

    No full text
    This paper addresses the issues of single microphone based noise estimation technique for speech recognition in noisy environments. Many researches have been performed on the environmental noise estimation; however, most of them require voice activity detection (VAD) for accurate estimation of noise characteristics. We propose an approach for efficient noise estimation without VAD, aiming at improving the conventional quantile-based noise estimation (QBNE). We fostered the QBNE by adjusting the quantile level according to the relative amount of added noise to the target speech. From the observation that the power spectral density (PSD) of noise is close to the Gaussian distribution, while that of speech is more narrowly populated, the level of additive noise is measured by the selected Gaussianity functions. We compared the proposed method with the conventional QBNE and minimum statistics based method on a simple speech recognition task in various SNR levels. The experimental results show that the proposed method is superior to the conventional methods
    corecore